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ABSTRACT 

This paper examines research on classroom programs 
for elementary school students who are at risk for learning problems* 
The full range of alternative classroom organization models designed 
to meet the needs of low-achieving or heterogeneous classes is 
explored in an attempt to discover which type of program is mo.t 
effective and why. The goal of this study is to determine how the 
education needs of all students can be met by fundamentally 
restructuring the regular classroom, as opposed to adding on s^ervices 
outside of the regular classroom. Therefore this review focuses on 
the specific, comprehensive programs beneficial to the achievement of 
students who are at risk and that are replicable at other schools. 
Research demonstrates that effective classroom programs accommodate 
instruction to individual needs while maximizing direct instruction, 
and assess student progress fn^guently through a structured hierarchy 
of skills. Two cat'^gories of programs emerged as particularly 
effective: continuous progress and cooperative learning. An extensive 
list of references is included. Descriptions of several types of 
successful programs are appended. (PS) 
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In recent years, there has developed a growing dissatisfaction with the 
educational services provided to students who are "at risk" for developing 
learning problems or school failure. Much of this dissatisfaction has 
centered on the practice of pulling low achieving students out of class for 
remedial instruction, usually provided as part of the Federal Chapter I 
(formerly Title I) program. Some researchers have found that the more time 
students spent in puil-out programs, the less they learned (Coulson et al., 
1977; Glass & Smith, 1977), Chapter I pull-outs have been criticized on the 
basis that instruction in the pull-out program is rarely well integrated 
with that provided by the regular classroom teacher (Kaestle & Smith, 1982; 
Johnston, Allington, & Afflerbach, 1985;. Also, much time is lost in 
transitions between regular and pull-out settings (Allington, in press), and 
pull-outs rarely increase the total instruction provided to students 
(Vanecko, Ames, & Archambault, 1980; Kimbrough & Hill, 1981), Similar 
problems of coordination and of supplanting of regular classroom instruction 
have been noted for special education pull-outs (Sargent, 1981) and when 
students qualify for special education and ranedial services, these problems 
multiply (Biman, 1981). 

While the problems of pull-outs are well recognized, alternatives to 
pull-outs have their own drawbacks. Having the remedial teacher work in a 
corner of the regular classroom, as is typical in in-class Chapter I models, 
provides no guarantee that coordination with the regular program will be 
enhanced (see Lee & Rowan, 1986). "In-class" models are often structurally 
identical co pull-out programs except that the remedial services are 
provided in a different location. Research comparing pull-out and in-class 
Chapter I/Title I models has found few differences in achievement effects 
(Archambault, 1987). 



One point on which there is agreement among virtually all reviewers of 
research on Chapter I and special education programs for students at risk is 
that program quality is far more important than the setting in which the 
program is implemented (Leinhardt & Pal lay, 19C3; Madden & Slavin, 1983; 
Archambault, 1987). Clearly, there can be effective pull-out programs and 
effective in-class programs to meet the needs of at-risk students. This 
paper examines research on classroom programs for elementary school students 
who are at risk for learning problems. The full range of alternative 
classroom organization models designed to meet the needs of low-achieving or 
heterogeneous classes is explored; the principal question this paper asks is 
how can the educational needs of all students be met by fundamentally 
restructuring the regular classroom, as opposed to adding on services 
outside of the regular classroom. A companion paper (Madden & Slavin, 1987) 
reviews research on effective pull-out programs for elementary students, and 
a third paper in this series by Nancy Karweit (1987) discusses effective 
programs for preschool and kindergarten students. 

Defining "At Risk " 

The definition of "at risk" applied in this paper is deliberately vague. 
Many factors tnay identify a student as being at risk for school failure; 
among these are low socioeconomic status, low measured ability, learning 
disabilities, or perhaps most importantly, learning problems early in the 
schooling experience* While each of these risk factors is conceptually 
distinct, all but the first (low SES) are difficult to distinguish in 
practice (see Ysseldike & Algozzine, 1982). With young students, low 
socioeconomic status may be used as a very rough means of identifying groups 



of students who are likely, on average, to experience learning problems, but 
of course many lov~SES students are quite successful in school. However, by 
the end of the first or second grade it i.r often clear that some students, 
regardless of SES, are beginning a process of falling further and further 
behind. With the exception of a few easily remediable conditions (such as 
poor eyesight or hearing), the fact that students are not achieving 
adequately in the early grades, for whatever reason, is the most important 
diagnostic indication that a students is at risk. Students who had early 
school problems are heavily overrepresented among school failures (Lloyd, 
1978), dropouts (Stroup & Robins, 1972), delinquents (Kelly, Veldman, & 
McGuire, 1964), and others who ultimately experience serious problems as 
they proceed through the school years. 

Scope of the Review 

The focus of this review is on specific^, comprehensive programs that may 
be beneficial for the achivement of students who are at risk for learning 
problems., A program is defined here as a set of procedures intended to be 
implemented as a total package and capable of being replicated by others. 
These features distinguish a program from a set of variables. For example, 
there is evidence that such variables as strong principal leadership, high 
expectations, and high time on-task are related to student achievement gains 
(see, for example, Brophy & Good, 1986). However, these variables do not in 
themselves constitute a program; a program would lay out a specific set of 
procedures that might incorporate these variables (and others) but v^ould do 
so in a structured and replicable format. The word "replicable" is 
particularly important in the definition of a program. It is often the case 



that a particular school or district has considerable success with at-risk 
students, but the success is due to the unusual skill or devotion of a set 
of individual principals or teachers rather than to any particular set of 
practices that can be easily transmitted to others. 

Review Procedures 

This review used a set of procedures adapted from best-evidence synthesis 
(Slavin, 1986) , a review method which combines features of meca-analytic and 
traditional narrative reviews. In essence, a priori inclusion criteria 
relating to internal and external validity were established and a broad 
literature search was conducted to identify studies which met those 
criteria. When possible, achievement effects of the qualifying studies were 
characterized in terms of affect size, the difference between experimental 
and control means divided by the control groups' standard deviation (Glass, 
McGaw, & Smith, 1981). However, the strengths and weaknesses of the 
individual evaluations are described to set the effects in context. These 
procedures are described in more detail in the following sections. 

Literature Search . Every effort was made to search the literature on 
instructional programs for elementary-aged students (Grades 1-6) as broadly 
as possible. Academic journals yielded a few relevant studies, bu': research 
on comprehensive instructional programs is r&rely reported in these 
journals. The mott fruitful source of information by far was reports 
submitted to the Joint Dissemination Review Panel (JDRP), a U.S. Department 
of Education panel which reviews evaluations of programs originally 
supported by federal funds. Programs whose effects are certified as valid 
by the JDRP are eligible for funding and dissemination through the National 




Diffusion Network (NDN) . What makes the JDRP submissions such a rich source 
of informs -ion for identifying effective programs is that with fe 
exceptions the prog -ams submitted to the JDRP were designed to be 
replicable; the main reason for a school district or research and 
development organization to go through Jl RP review Is to obtain funds to 
nationally disseminate a program, so data on programs that could not be 
easily transported to a new location would be unlikely to be submitted to 
the JDRP. Also, the JDRP rtjview process demands enough data to enable 
reviewers to determine program effects. The submissions for more than 300 
programs which successfully passed JDRP review were obtained from the 
Department of Eduacation. Also, the Northwest Regional Education Laboratory 
recently published an Effective Compensatory Education Sourcebook (Griswo?^.d, 
Cotton, & Hansen, 1986) which listed 116 district Chapter I programs 
nominated and selected as being particularly effective. Data and program 
descriptions were requested from these districts. Finally, ERIC documents, 
journal articles, and other reports were obtained in a general literature 
search relying in particular on references from other reviews (e.g., Ellson, 
1987). 

Substantive Inclusion Criteri^ In order to be considered getmane to the 
scope of this review, studies had to conform to a broad set of criteria: 

1. The programs evaluated had to be directed toward increasing the 
reading and/or math^maticj achievement of students in grades 1-6. Programs 
which were designed primarily for use in kindergarten or preschool are 
discussed by Karweit (1987), while those which overlapped the elementary 
grades (e.g., K-2, K~6, K~12) are included here. 
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2. The programs had to be implemented in regular classrooms. This 
excludes self-contained special education programs. Pull-out and tutoring 
programs are reviewed by Madden & Slavin (1987) . 

3. The programs had to be applicable to the education of at-risk 
students. This criterion was interpreted very broadly. Even if programs 
did not present evidence that they had been successfully evaluated in 
schools containing disproportionate numbers of at-risk students, they were 
included if they could in principal be applied to such students. Ho^^ever, a 
lack of data on effects for at-risk students is noted as a drawback in 
descriptions of effective programs. 

4. Programs specifically designed for non-English speakers are not 
included in this review. 

Methodol o gical Inclusion Criteria . Unfortunately, the methodological 
quality of most evaluations of instructional programs is very low. Many 
evaluations compare the achievement of students in experimental programs to 
that of students in "comparable" schools, with no evidence given that the 
comparison schools were in fact comparable before the treatments were 
administered. Others lack even this much control, claiming that since 
students achieved above "grade level expectations, " the program must be 
successful. 

Parhaps the most common reason for invalidity of evaluation design is the 
use of fall-to-spring gains in percentiles or, more commonly, normal curve 
equivalents (NCE's). A normal curve equivalents is a standard score with a 
mean of 50, a standard deviation of about 21, and a range of 1 to 99. NCE's 
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are therefore similar to percentile scores, but they are preferable because 
they use an equal-interval scale. Ir recent years, districts receiving 
Chapter I funds have been encouraged to report fall-to-spring gains in NCE's 
for their Chapter I recipients, which helps explain the popularity of this 
design. 

The rationale behind the fall-to-spring design is sensible in theory. If 
students score at, say, a mean NCE of 40 in the fall and 50 in the spring, 
then they have apparently gained relative to the national norming group, 
which should have remained at the same level on average. However, in recent 
years it has become app'^rent that this assumption is incorrect. Among 
Chapter I populations, NCE gains of ebout 8 points are routinely made from 
fall to spring (see Gabriel et al., 1985; Keesling, 1984). However, by the 
following fall, these differences have generally disappeared. The 
fall-to-spring gain is probably a statistical artifact, not a true effect of 
Chapter 1. For example. Chapter I-eligible students who do not actually 
receive Chapter I services make similar gains (Carter, 1984; Granick et al«, 
1984). Almost all of the "exemplary" Chapter I programs cited by Griswold 
et al. (1986) which provided multi-year data showed a pattern of 
fall-to-spring gains follovjed by equal spring-to-fall declines; the 
fall-to-fall or spring-to-spring gains were essentially nil over as many as 
five years. Nationeil data reported by Gabriel et al. (1985) indicate that 
the fall-to-spring effect is largest in the younger grades, diminishing from 
almost 9 NCE's in grade 2 to 4.5 in grade 12. 

Unfortunately, the cause of the fall-to-spring artifact is unknown (see 
Kennedy and Biman, 1986), and it is unlikely that the gains are equal 
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across tests, types of students, or other factors. Therefore, it is not 
even possible to assume that because a particular district reports NCE gains 
much more than eight that this is evidence of an effective program. There 
are examples of school districts in which f all-to^spring gains of 16 NCE's 
or more are reported each year, yet no growth from fall-to-fall or 
spring-to-spring is evident. 

Program evaluations which use fall-to-fall or spring-to-spring NCE or 
percentile gains as a criterion for program effects are beJ:ter than 
fall-to-spring studies, because the expected gains for annual testing are 
much smaller, in the range of 1-3 NCE's (Gabriel et al, 1983). Yet these 
have serious drawbacks as well. Some evaluations report annual gains for 
successive cohorts of students at a particular grade level (e.g., third 
graders in 1978-80 vs. third graders in 1980-81). Cohort differences can be 
affected by many extraneous factors, such as changes in district testing or 
promotion procedures. However, this design is preferable to a comparison of 
this year's gain to last year's for all students in a grade level, which 
confounds programs effects with retention/promotion, selection into or out 
of Chapter I or other special programs, and other factors. Comparing last 
year's gains to this year's for promoted students only is a much better 
procedure, but is rarely used; including non-promoted students inflates 
estimates of annual gain because these students are being compared ag in to 
the same grade-level norms after a year of instruction and growth. 

The inclusion criteria applied in the present review follow from the 
above discussion. The purpose of these criteria is to extract from a large, 
diverse, and messy literature the most convincing evidence available at 
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present on effective classroom programs for students at risk. The criteria 
are as follows. 

1. Convincing evidence of effectiveness had to be presented. 
Unfortunately, the nature of existing program evaluation data and of the 
sources of reports on such data precludes identif rlcation of programs which 
are not effective. The JDRP and examplary Chapter I programs only come to 
our attention because they were felt to be outstanding. A pattern of 
fall-to-spring gains cannot be used as evidence of effectiveness , but 
neither is it convincing evidence of ineffectiveness . In a usual review of 
the literature on experimental programs, effective and ineffective programs 
would be compared in .an attempt to identify consistent char acteristics of 
the more effective ones (see Cooper, 1984; Light & Pillemer, 1984; Slavin, 
1986). However, in the present case, the review focuses on those programs 
with convincing evidence of effectiveness, as distinct from others which may 
or may not be equally effective but do not present convincing data. A list 
of programs considered for inclusion but not included and reasons for 
exclusion is presented later in this paper. 

The remaining methodological inclusion criteria define what is meant by 
"convincing evidence of effectiveness." 

2. Eval^ Jns had to use control group designs with random assignment to 
groups and/or convincing evidence that comparison groups were initially 
equivalent in achievement. Studies which showed spring-to- spring gains of 
at least seven NCE's were also included, but in a separate category. This 
is about one-third of a population standard deviation ^approximately 21), 
but more like half of the standard deviation typical of Chapter I students. 
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who have a more restricted range of scores. One ;mportant category of 
programs excluded from the set of effective models is those which were 
involved in the Abt Follow- Through evaluation and were found to be no more 
effective than control groups overall , even though they were found to be 
effective in one or more sites. In many cases, data from the successful 
sites passed JDRP review, as the JDRP procedures do not require that data 
from all sites be submitted or considered. 

3. Evaluations had to use standardized, broadly based measures of reading 
and/or mathematics achievement. Thip excluded a small number of studies 
which used experimenter-made measures or only assessed a narrow set of 
reading or mathematics skills. 

4. The duration of evaluations had to be at least one semester (16 
weeks). Almost all JDRP and Chapter I evaluations cover at least a one- 
year period, but many evaluations of promising programs which have appeared 
in academic journals have involved much shorter treatment durations. To be 
considered a practical evaluation of a replicable program, a duration of one 
semester is a minimal requirement. 

Categories of Effective Models 

The principal results of the search" for instruct ionffl^mefhods with 
convincing evidence of positive effects are summarized in Tables 1-3. In 
each table, programs evaluated in studies which employed matched or (rarely) 
randomized control groups are listed . ;rst, with associated effect sizf/s. 
These effect sizes should be interpreted very cautiously and should not be 
compared across methods, but are included to characterize the relative 
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magnitudes of effects on different outcomes or for different jubpopulatiois 
within programs. 

The effective models are discussed according to categories of programs. 
Surprisingly, among the dozens of programs evaluated for possible inclusion, 
programs which presented convincing data fell into only three categories: 
continuous progress, individualized instruction, and cooperative learning. 
There are defined below. 

Continuous Progress 

By far the largest number of programs with convincing evidence of 
effectiveness are those falling in the category "continuous progress." The 
individual programs in this category vary in many particulars, but featui.*" 
common to all include the following. 

1. Students procede through a well-specified hierarchy of tkilJs. They 
are tested at each level to determine their readiness to move on to the n .t 
skill. Special procedures are established to assist students who fail to 
pass regular mastery assessments. These may include corrective instruction 
in small groups, tutoring, assignment to different groups, or special 
materials or activities. In these respects continuous progress programs 
resemble mastery learning and they are often described as such. Howe^rer, 
continuous progress programs are fundamentally different from the 
group-based mastery learning programs more commonly reen in elementary and 
secondary schools (see Slavin, in press a). Group-based mastery learning is 
discussed later in this paper. 
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2. Careful records are kept of each student's progress through the 
curriculum. Thess data are used to make grouping* remediation^ and other 
decisions, 

3. Most instruction is delivered by teachers to groups of students at 
the same instrucitonal level. This is primarily what differentiates 
continuous progress from individualized models. In continuous progress 
models, students progress at their own rates, but are primarily instructed 
by ceachers rather than by programmed or other individualized materials. In 
these methods, students are constantly grouped and regrouped according to 
their levels of reading and/or math .skills; this grouping is often done 
across grade lines, as *n nongraded j^^rograms (see Goodlad & Anderson, 1963) . 

j^ndividualized Instruction 

1. Students work primarily on programmed or other individualized 
materials, with teachers working mostly with individuals rather than groups 
of students. This is the primary difference between continuous progress and 
individualized models; many continuous progress models describe themselves 
as "individualized," but do not rely on self-instructional materials. 

2. As in continuous progress programs, individualized models keep 
careful records of student progress through a structured, hierarchical set 
of learning objectives. 

Cooperative Learning 

Although there are many methods based on principles of cooperative 
learning (See Sharan, 1980; Slavm, 1983), only two have been evaluated over 
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periods of at least one semester in terms of effects on standardized tests 
of reading and math: Team Assisted Individualization (TAI; Slavin, 1985) 
and Cooperative Integrated Reading and Composition (CIRC; Stevens, Madden, 
Sla^vin, & Famish, in press). In addition to structuring cooperation among 
students, both methods also incorporate elestants of continuous progress as 
well# The elements in common to TAI and CIRC are as follows. 

1. Studv^nts work in 4-5 member mixed-ability learning teams and receive 
certificates ox other recognition based on the learning of all group 
members. Students both help one another learn and assess one another's 
skills in preparation for tests or teacher assessments which will be taken 
without teammate help. 

2. Teachers instruct students (drawn from the different teams) who are 
at the same level in a hierarchy of skills. Most information comes from the 
teachers, not from peers or materials; teammates help one another master 
skills, but do not provide instruction to one another. 

3. Students are frequently assessed, first by teammates and then by the 
teacher. Specific corrective procedures fre provided for students who do 
not meet a preset level of mastery. 



Table 1 Here 
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Ch ar ac t v3 r is t ics and Outcomes of Effective Models 
Continuous ProgresiV 

A total of eleven programs were categorized as continuous progress models 
with convincing evidence of effectiveness. Seven of these used control 
group designs* 

Distar * Distar (Becker & Gamine, 1980) ic a program developed at the 
University of Oregon. Distar is unusual, even within the range of 
continuous progress programs. It provides teachers with very specific 
scripts to use in teaching reading and math and trains teachers in very 
specific methods, down to the level of how to use hand signals to elicit 
student responses and how to call on students. In describing the program, 
Distar' s developers tend to emphasize the sequential, hierarchical 
curriculum design, direct instruction ani rapid pace, and high frequency of 
student reponses (Becker & Cp.mine, 1980). However, what defines Distar as 
a continuous progress model is that students are taught in small groups that 
are homogeneous in skill level, are assessed frequently on their progress 
through a well-defined skill hierarchy, and are regrouped if necessary 
according to the results of these assessments. 

The evidence supporting the effectiveness of Distar for increasing 
student performance in certain skill areas is very strong. Distar was the 
only one of the nine major programs evaluated in the national Planned 
Variation Follow Through /studies conducted by Abt Associates (Stebbins, St. 
Pierre, Proper, Anderson, & Cerva, 1977) to have consistently positive 
effects on the achievement of disadvantaged students. The effect size 
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estimates which appear in Table 1 are from the Abt evaluation, pooling 
across cohorts (see Kennedy, 1978). As the table indicates, however, the 
effects of Distar appeared primarily on language and math computations 
tests, not on such higher^order skills as reading comprehension or math 
problem solving. Even on these scales, though, Distar students scored 
higher than all other Follow Through models. Note that these effects were 
brought about over three or four years of implementation, as the Abt 
evaluation focused on students who were in their respective programs from 
kindergarten or first to third grade. 

What makes the Abt evaluation results particularly impressive is that 
thi' evaluation involved many sites, matched control groups, and consistent 
measurement across non-Follcw Through and various Follow Through treatments. 
In reviewing instructional programs for this paper, there was always a 
nagging question of whether reports of successful evaluations sent to the 
JDRP or published in a journal are anomalies, perhaps one or two sites out 
of dozens in which the program was evaluated. In fact, in several cases 
sites which had positive results successfully passed JDRP review even though 
they were using methods which were unsuccessful at most other sites. The 
effects for Distar summarized in Table 1 are averaged across all sites vhich 
used the program, successfully or not. 

Recent research on Distar has found that the program's effects can be 
long-lasting. Following up students from an inner-city New York elem.-ntary 
school, Distar students exceeded control in high school graduation; about 
55% of the former Distar students graduated, compared to 34% of control 
students (Meyer, 1984). 
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Despite the well- documented and widely acknowledged positive effects of 
Distar, there has also been much criticism of this approach, principally on 
the basis of its use of scripted lessons and a perceived focus on rote 
skills rather than higher order, leaming-to-leam skills (see, for example, 
Calfee, 1986)* Many teachers and administrators resist Distar for similar 
reasons. Given this reality, it is important to note that Distar is only 
one of several successful continuous progress models, is the only one to use 
scripted lessona, and is one of only two (with ECRI) to emphasize the highly 
organized, teacher^centered classroom organization that many teachers find 
offensive. 

U-SAIL t U-Sail (for Utah System Approach to Individualized Learning) was 
developed and e^^aluated in Utah. It is a continous progress program quite 
different from Distar. In it, students procede through a hierarchical 
sequence of objectives at their own rates. Most instruction is given to 
small groups, but some individual work is included. While the teacher is 
instructing skill-level groups, other students are given independent, 
exploratory activities, with a particular emphasis on independent reading 
(See Hales, 1983 a, b) . 

The JDRP submission for U-SAIL presents data from only one elementary 
school in suburban Davis County, Utah. Evidence is provided to show that 
the school was similar to the control school for three years before U-SAIL 
was begun, but gained more than the control school afterwards. The data in 
Table 1 represent the degree to which U-SAIL students out-performed control 
students in reading and math from grades 3-5. Unfortunately, we have no 
idea how successful the program has been in other schools, or whether the 
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active involvement of the developer is needed to make the program 
successful. Also, no evidence is given that U-SAIL can be effective with 
disadvantaged or low-achieving students. 

PEGASUS . This program was apparently the first to pass JDRP review, as 

it was given the first JDRP number. PEGASUS is a classic continuous 

progress program. It organizes the reading program into 17 levels, spanning 

grades Students progress through these levels at their own rates, but 

are taught in groups appropriate to their current levels. Within each of 
» 

the levels is a continuum of skills which students must master. 

PEGASUS was originally developed in Tuscaloosa, Alabama, but data could 
only be obtained for evaluation of a "turnkey" site in rural Illinois. In 
this evaluation, schools in Bureau County were matched with similar schools 
in La Salle County for a one-year study. Results indicated significantly 
greater growth in reading for PEGASUS students at most grade levels on 
vocabulary as well as reading comprehension measures. However, while some 
of the students were poor, results for these students were not separately 
reported, so generalization to disadvantaged groups may not be warranted. 

ECRI. The Exemplary Center for Reading Instruction, or ECRI was, like 
U-SAIL, originally developed in Utah. However, it is quite unlike U-SAIL in 
procedures, more resembling Distar in classroom organization, rapid 
instructional pace, detailed and specific instructions for teachers, and 
frequent assessment of student progress. In ECRI, students are usually 
assigned to three reading groups. They receive instruction in their groups, 
and then work on materials at their own rates. ECRI emphasizes teaching 
specific word attack skills and proceding through large numbers of words in 
a short time. 
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Most evaluations of ECRI presented as part of the JDRP submission used 
fall-tc-spring designs. One study^ in a relatively low SES school in a 
suburb of Cincinnati* used a control group design, and the results of this 
study were used to compute the effect sizes shown in Table 1. Also, 
spring-to-spring data from a mostly low SES, all black school in rural 
Louisiana and from a Chapter I school in suburban Englewood, Ohio suggest 
that ECRI can be effective with disadvantaged and low- achieving students. 

Project INSTRUCT . Like PEGASUS, Project INSTRUCT is a classic continuous 
progress program. It was developed and evaluated in Lincoln, Nebraska. In 
the program, students are grouped according to skill levels, and may procede 
through a hierarchy of skills at their rates. An evaluation of the program 
by Novak (1975) compared schools which successfully implemented Project 
INSTRUCT to matched control schools, and found small positive effects in 
reading and math. 

GEMS. Goal-Based Educational Management System, or GEMS, is a 
diagnostic-prescriptive reading program developed in Utah. In this program, 
students procede at their own rates through 200 skill levels covering grades 
K-12. Each unit has a pretest and a posttest. According to placement tests 
or pretests, students are placed in appropriate instructional groups. 
Teachers use a variety of teaching strategies, following which students must 
attain a score of at least 80% to exit the unit. Those who do not achieve 
this score are given alternate materials and enough time as they need to 
ultimately pass. A computer management system helps teachers keep track of 
student progress. 
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An evaluation in rural Jordan, Utah found that GEMS students generally 
exceeded control in vocabulary, although differences in reading 
comprehension were small and were not found in grades 1 and 2. No evidence 
is given to suggest that the program might be effective with disadvantaged 
students. 

E arly Childhood Preventative Curriculum , This program is quite different 
from the others in Table 1 in that it is intended to be used intensively 
with high-risk first graders. These students are put into a special class 
during first grade and experience an individualized diagnostic-prescriptive 
program, in which students' individual strengths and weaknesses are 
identified and students are allowed to procede at their own rates. Most 
instruction is given in small, skill-level groups. 

ECPC was developed and evaluated in Miami, Florida. The evaluation 
compared the program to matched as well as random controls. Overall, 
positive effects were found on the Paragraph Meaning scale of the Stanford 
Achieveaient Test and (to a lesser extent) on the WRAT. 

Continuous Progress Programs Evaluated Using Year-to-Year Gains. In 
addition to the programs listed above, four additional continous progress 
models presented convincing evidence of year-to-year gains: Weslaco 
Individualized Reading/Language Arts Instruction and Staff Development 
Process (WILASD) . Conceptually Oriented Mathematics Curriculum (COMP), 
Coordinated Learning Integration — Middlesex Basics (CLIMB), and 
Outcomes-Driven Developmental Model (ODDM). The procedures used in these 
models generally fall within the range represented among the other continous 
progress programs. 
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A few additional continuous progress programs passed by the JDRP might 
also be effective, but did not present f-dequate evidence^ For example. 
Intensive Reading Instructional Teams (IRIT; JDRP No, 74-11) presented only 
fall-to-spring percentile gains* The Cranston Comprehensive Reading Program 
(JDRP NO. 82-28) ''sed a regression-discontinuity evaluation design which 
showed an increase in student scores the year the program was begun, but the 
district also changed test forms at the same time, making the gains 
difficult to ascribe to the program with any conf : "^.ence. 



Table 2 Here 



Individualized Instruction 

Table 2 lists individualized instruction programs which presented 
convincing evidence of effectiveness. Only three programs fall into this 
category and of these, only one used a control group design. 

Matteson Four-Dimens ional Reading Program . This program, developed and 
evaluated in a Chicago suburb, provides students with individualized 
learning packets through which students procede at their own rates. Some 
small- and large-group activities are used, and the individualized work 
occupies only 40-60% of the total readin[^ period, so the Matteson model 
cannot be called a "pure" individualized program. An evaluation ^jmparing 
matched schools found that fourth graders in the Matteson program gained 
more than control students in Stanford Achievement Test scores over a 
three-year period. However, no evidence is given that would suggest that 
this program could be successfully aplied to at-risk students. 
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AIRS. Andover's Individualized Reading System (AIRS) is another 
self-paced instructional model developed and evaluated in a well--to-do 
school district, Andover, Massachusetts. AIRS is a classic, comprehensive 
individualized approach which uses programmed self-instructional materials. 
Almost all language arts instruction is delivered through the materials with 
teachers, aides, and parent volunteers giving one-to-one assistance as 
needed. Comparisons of Stanford Achievement Test scores in student: cohorts 
before and after the AIRS program was introduced show clear gains which were 
maintained over several years. An AIRS evaluation in rural Rhode Island 
also found achievement gains each year after the program was installed. 
Again, no evidence is given to suggest that AIRS would be effective with 
disadvantaged or low-achieving students, 

STAMM . Systematic Teaching and Measuring Mathematics (STAMM) was 
developed and initially evaluated in Jefferson County, Colorado, a Denver 
suburb. Like AIRS, STAMM is a fully individualized instructional program, 
which relies primarily on self-instructional materials and one-to-one 
instruction by teachers and aides. Evaluations in Jefferson County and in 
an Atlanta suburb showed that students who experienced STAMM scored higher 
on the CTBS than did students in earlier cohorts. Special forms of STAMM 
exist for use with Chapter I and learning di..abled students, but evaluations 
with these or other at-risk students are not known to exist. 

Other Individualized Programs . What is noteworthy about Table 2 is not 
so much the programs listed there as the programs not listed. A large 
number of JDRP-approved programs used individualized models, and the broader 
educational literature has many studies of such methods. Yet very few of 
these present convincing evidence of effectiveness. 
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One individualized progrem which was carefully considered for inclusion 
in Table 2 is the University of Kansas Behavior Analysis Program (Ramp & 
Rhine, 1981). This program is generally acknowledged to have been the 
second most effective of the Planned Variation Follow Through models, after 
Distar, Also, two Behavior Analysis sites, in Trenton, New Jersey (JDRP No. 
77-139) and Waukegan, Illinois (JDRP No. 77-126) passed JDRP review with 
convincing, control-group evaluations (conducted as part of the overall Abt 
evaluation). However, site variability for Behavior Analysis was extreme; 
in some locations, especially New York City, Behavior Analysis students 
scored significantly worse than non-Follow Through students. Overall, 
Kennedy (1978) reports effect sizes of -.12 for reading comprehension and 
+.02 for language. Only in math computations is there positive evidence 
(ES=+.28), but math problem solving scores slightly favored the control 
group (ES=-.06). 

Another individualized model, the Responsive Early Childhood Education 
Program (RECEP; JDRP No. 77-154) also presents a paradox in that data 
submitted to the JDRP show some positive effects but the overall combined 
results from all Planned Variation sites did not. Se^'ural other 
individualized programs compared experimental to "matched" control groups 
without establishing that the control groups were initially equivalent 
(e.g., Randolph County Follow Through, JDRP No. 77-149), or presented only 
minimfil spring-to-spring gains (e.g. Personalized Classroom Management 
System, JDRP No. 78-170) . 

The JDRP programs which were excluded from Table 2 rarely provide data 
that could indicate that the programs are not effective, as such programs 
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would be unlikely to pass JDRP review. However, in the broader education 
literature there have been several good-quality evaluations of 
individualized programs which have indicated few positive effects (see 
Horak, 1971; Mill<»r, 1976; Thompson, 1975). When positive effects of 
individualized programs have been found, they are often in studies with poor 
methodological characteristics. For example, Wang and Walbers^ (1983) 
claimed positive effects of the University of Pittsburgh's Adaptive Learning 
Environments Model (ALEM) presenting comparisons of "expected" scores based 
on national norms to scores of students in the ALEM program. However, the 
one study of this method to use a control group found no achievement 
differences (Wang & Birch, 1984). 



Table 3 Here 



Cooperative Learning 

Table 3 lists two cooperative learning programs, TAI and CIRC, both of 
which were developed and evaluated at the Johns Hopkins University Center 
for Research on Elementary and Middle Schools. All four of the studies 
which evaluated these models (plus four additional studies which had 
durations of less than a semester) us'-^d control group designs, includiag the 
only two studies in Tables 1-3 in which teachers were randomly assigned to 
treatments (Slavin & Karweit, 1985) • 

TAI, Team Accelerated Instruction or TAI (Slavin, 1985) combines 
cooperative learning with a continuous progress approach to mathematics 
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instruction. In this model, students are assigned to 4-5 aiember. 



mixed- ability learning teams, and they are also assigned to skill-level 
groups on the basis of a placement test. Teachers instruct the skill level 
groups on the concepts of mathematics, after which students return to theii. 
teams and work on self-instructional materials. Teammates frequently check 
one another's work against answer i*heets, assist one another with difficult 
problems, and prepare one another for quizzes. Quizzes are taken apart from 
the team area, and achieving mastery score on quizzes fldus points to 
students' "ream scores. At the end of each week, teams which exceed a 
pre-established criterion in terms of numbers of units mastered may earn 
certificates or other rewards. Specific procedures and para],lel assessments 
are provided for students who do not achieve at a mastery level. 

TAX has been evaluated in six field experiments, but only three of these 
met the one-semester duration requirement for inclusion in the Tables. Two 
of the three studies (Slavin & Karweit, 1985) are the only studies listed in 
Tables 1-3 which randomly assigned teachers to treatments. One of these 
took place in and around Hagerstown, Maryland, and the other in inner-city 
Wilmington, Delaware. The Hagerstown study compared TAI to a control group 
and to the Missouri Mathematics program (Good, Grouw. & Ebmeier, 1983), a 
whole-class instructional method which incorporates teaching behaviors 
derived from process-product research. The Wilmington study compared TAI to 
the Missouri Mathematics program only; and a study in a Baltimore suburb 
(Slavin, Madden, & Leavey, 1984) compared TAI to an untreated control group. 

Results of all three studies favored the TAI clcjses, with the largest 
effects in the Wilmington study. Separate analyses for mainstr earned 



-24- 




acrdemically handicapped students in the suburban study also found positive 
effects for these fltudonts. 

CIRC . Cooperative Integrated Reading and Composition (Madden et al«» 
1986; Stevens et al., in press) uses a combination oi mixed-ability, 
cooperative work groups and skill-bused reading groups to teach reading, 
language arts, and writing in the uppp.r elementary grades. In CIRC, 
students are assigned to pairs within their reading groups, and then the 
pairs are assigned to 4-5 member teams, so that the teams are heterogeneous. 
During reading periods, teachers work with reading groups over the course of 
each week to set a purpose for reading, introduce vocabulary, discuss the 
characters, setting, problems, and problem solutions in narrative stories, 
and so on. Students back in their team areas work in pairs on a prescribed 
series of activities, including reading basal stories to one another, 
identifying elements of story structure, predicting story outcomes, 
practicing vocabulary, word lists, and spelling, and writing in response to 
storiese. Each week teachers provide instruction in such reading 
comprehension skills as finding the main idea. Students earn points for 
their teams based on the sum of their individual performances on regular 
quizzes. The CIRC language arts program is based on a writing process 
model, with students working together in their teams to plan, draft, revise, 
edit, and ultimately publish compositions. Language mechanics instruction 
is provided in the context of students' writing. 

The one scmester-long study which evaluated CIRC found positive effects 
in comparison to matched control classes on CTBS reading comprehension, 
reading vocabulary, language expression, and language mechanics scales. 
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Separate analyses for mainstreamed academically handicapped students found 
strong effects on both reading scales but not on the language measures; for 
remedial reading students (mostly Chapter I), effects were similar to those 
in the full sample. 

Other Research on Cooperative Learning . While the four studies cited 
above are the only ones to use standardized reading and math measures at th 
elementary level in studies of at least a semester, several additional 
studies support the achievement effects of cooperative learning. Two 
shorter studies of TAX iound positive effects on math computations measures 
(Slavin, Leavey, & Madden, 1984), though a third found no diffe-^ences 
(Oishi, Slavin, & Madden, 1983). A twelve-week study of CIRC (Madden, 
Stevens, & Slavin, 1986) found positive effects on CTBS reading 
comprehension, reading vocabulary, language expression, and spelling scales 
Other cooperative learning methods, such as Student Teams- Achievement 
Divisions (STAD) and Teams-Games-Tournaments (TGT) have had positive effect 
in mathematics, language arts, and other subjects at the elementary and 
secondary levels (Slavia, 1983a, b) . In addition, all of the cooperative 
learning methods have had positive effects on such outcomes as race 
relations, acceptance of mainstreamed students, and self-esteem (Slavin, 



Considering that there are more than 400 programs passed by the JDRP, 
plus many others evaluated and reported in the educational literature, how 
is it that only sixteen programs in three categories are emphasized here? 
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Essentially, this paper took a conservative approach in identifying 
effective programs. Only programs with convincing evidence of effectiveness 
were listed, and even these are not all certain, as there is always the 
possibility that data submitted to JDRP or published in journals is taken 
from one or more selected sites and is not representative of all program 
sites. Yet it seems certain that there are -lOre than sixteen effective 
classroom programs. 

This section discusses the state of the evidence on well known or widely 
used models not represented among those lis 3d in Tables 1-3. 

Developmental/ Humanistic Models . One major category of programs not 
listed in this paper is models that might be categorized as 
"devel opmental/ human i rt i c. " This includes the open classroom, 
Piagetian-based models, and other types of programs in which students are 
given choices of activities appropriate to their developmental levels, and 
are encouraged to discover and experience language and mathematical concepts 
rather than to master a pre-established series of skills. 

One important example of the developmental/humanistic approach is the 
progr:4n developed by the Bank Street College of Education (Gilkeson, 
Smitbberg, Bowman, & Rhine, 1981). Several sites using the Bank Street 
Program passed JDRP review (JDRP No. 77-156). However, the overall effects 
of this program in the Planned Variation evaluation were about zero in 
reading and language and negative in math. The High Scope/Cognitively 
Oriented Curriculum (Weikart, Hohmann, & Rhine, 1981), a child-centered, 
Piagetian model which extends into the elementary grades, many of the 
elements of the Perry Preschool Program (Weikart, 1974), also has several 
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JDRP-approved sites (JDRT? No. 77-123), but came out near the bottom of the 
list in the overall Abt evaluations (Kennedy, 1978). What these findings 
may imply is that the developmental/humanistic models can be effective, as 
evidenced by their apparent success in neveral sites, but the evidence for 
this or evidence of what is required to ensure their success are essentially 
lacking. Also, it should be noted that critics of the Abt evaluation (e.g.. 
House, Glass, McLean^ & Walker, 1978) have argued that the use of 
standardized achievement tests may have biased the evaluation in favor of 
such highly structured programs as Distar and Behavior Analysis. However, 
the Abt evaluations focused on children who had been in their respective 
models three or four years. If they had any important effects on students' 
cognitive performance, a standardized test should have registered some (if 
not all) of those effects. 

Group-Based Mastery Learning . Another widely used approach to school 
improvement is group-based mastery learning (Block, 1984; Block & Anderson, 
1975; Guskey & Gates, 1985). The teacher assesses student learning at the 
end of a series of lessons. Those who achieve at a pre-established level of 
mastery (e.g., 80%) are given enrichment activities, while others are given 
corrective instruction designed to bring them up to the mastery criterion. 
Claims of substantial positive effects of group-based mastery learning on 
student achievement are based largely on very brief experiments. In a 
review of research on group-based mastery learning, Slav in (in press a) 
found that in studies of at least four weeks' duration there were no 
significant positive effects of mastery learning on standardized measures, 
and moaest effects (median ES=+.25) on experimenter-made measures. However, 
it is important to note that the concepts of mastery assessment and 
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corrective instruction are central to all three categories of effective 
programs emphasized in this paper, and some of the continuous-progress 
programs, such as GEMS and the Outcomes-Driven Developmental model, identify 
tuemselves as mastery learning models (see Levine, 1985). 

Principles of Effective Classroom Programs for Students at Risk 

The purpose of this paper is not so much to identify individual programs 
as to discover principles which underlie effective models for students at 
risk. The effective programs identified in an extensive review of the 
literature fell into three categories: continuous progress, individualized 
instruction, and cooperative learning. However, the three effective 
individualized instruction models were all developed and evaluated in middle 
class, non-disadvantaged areas in which few students could be categorized as 
"at risk" for learning problems. Also, the broader literature has not 
tended to support the use of individualized, programmed instructional 
models. AIRS, STAMM, and the Mattes on 4-D models may have unique features 
which could make them effective with at-risk students, but given the past 
history of research on individualized programs, this remains to be seen. 

It is in the remaining two categories that the greatest confidence can be 
placed* In the case of the continuous progress models it is important to 
note not only the quality of the individual studies but also the number of 
successful models which began from very different theoretical bases and 
ended up with similar programs. Some of these models describe themselves as 
"individualized" (e.g., U-SAIL), but in order to provide adequate 
instruction to all students, incorporated groupings of students according to 
skill levels. Others (e.g., GEMS, ODDM) describe themselves as "mastery 
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learning." but incorporated subgrouping of students to accommodate 
instruction to individual needs. Developers of Distar describe the model as 
"direct instruction," yet the skill-level grouping, frequent assessment, and 
progress through a structured hierarchy of skills have little in common with 
more recent class-paced forms of "direct instruction" such as the Missouri 
Mathematics Program (Good et al., 1983). 

Although very different in many respects, the effective cooperative 
learning programs listed in Table 3 have many features in common with the 
continuc'Us progress models. First, instruction in TAI and CIRC is directed 
toward snail, homogeneous groups of students, so instruction is at a level 
appropriate to student needs. Second, instruction comes from the teacher, 
not from writ^en materi.als or peers. Third, students' prgress is constantly 
assessed and specific procedures exist to remediate any small problems 
before they become ^arg^ ones. Finally, in TAI and in the reading 
comprehension component of CIRC, there is a structured hierarchy of skills 
which students must master. Cooperative learning methods add to these the 
opportunity for students to explain concepts to one another and help one 
another study, team incentives based on the learning of team members, and 
student responsibility for many elements of classroom management. This last 
feature may be particularly important in today's lean times; TAI and CIRC 
have always been evaluated in classrooms in which there are no aides or 
volunteers, while continuous progress and individualized models have almost 
always had additional personnel. 

What '^merges from an examination of the effective classroom programs is 
the following characterization: Consistently effective classroom programs 
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accommodate instruction to individual needs while maximizing direct 
instruction , and they frequently assess student progress through a 
structured hierarchy of skills. 

The first part of this statement, the need to accommodate instruction to 
individual needs while maximizing direct instruction, is also supported by 
research on ability grouping in elementary schools, recently reviewed by 
Slavin (in press b) . This review found that while assigning students to 
self-contained classes on the basis of general ability was ineffective, two 
forms of ability grouping did have considerable support in good-quality 
experimental studies. One was the Joplin Plan, in which students are 
regrouped for reading across grade lines into reading classes all operating 
at one reading level. This plan obviates the need for grouping within the 
reading class, giving teachers the opportunity to spend all of the reading 
period in direct instruction. Studies of the Joplin Plan and closely 
related forms of non-graded plans found consistent positive effects on 
standardized reading measures (median Eo=+.44). 

The second form of ability grouping found in the Slavin (in press b) 
review to be instructional ly effective was within-class ability grouping in 
math (median ES=+.34). Use of math groups, especially if the number of 
groups is kept to two or three, strikes a balance between accommodating 
instruction to individual needs and providing adequate direct instruction. 
Since there is a need for independent seatwork in math, grouping within the 
class to meet students' diverse needs and having the teacher present two or 
three lessons at different levels may be the most efficient use of 
instructional time. 



Slavin (1987) has argued that to make a meaningful difference in student 
achievement^ four elements of classroom organization must be simul taneou&ly 
addressed: Quality of instruction, appropriate level of instruction, 
incentive, and time. The continuous progress and cooperative learning 
methods address most of these elements, in particular in providing 
appropriate levels of instruction (by grouping students according to skill 
level) while maintaining an adequate quality of instruction (by having the 
teacher be responsible for direct instruction and by having a well thought 
out sequence of instructional objectives). Cooperative learning adds to the 
incentive element the use of team rewards based on team members' learning, 
and both types of programs emphasize effective use of time through careful 
structuring of classroom activities. 

The importance cf accommodating student needs while maintaining adequate 
direct instruction is perhaps greatest for at-risk students. Students who 
begin to experience difficulties in their early years in school are unlikely 
to be able to keep up with the clacs pace or grade level expectations. In 
response to this, schools provide remedial and special education for 
students who are falling behind. Yet these programs rarely accelerate 
students enough to enable them to catch up with their classmates (Madden & 
Slavin, 1983; Carter, 1984). Building classroom models able to meet diverse 
needs would seem to be a better approach to the problems of at-risk students 
than creating a parallel instructional system, which creates a large set of 
new problems in terms of coordination with the regular program, transitions, 
and so on (Johnston et al., 1985). 



ERIC 



-32- 

34 



Conclusions 



This paper reviewed all sources of evidence concerning classroom programs 



which have been or could be applied to accelerating the reading and math 
achievement of at-risk students. Among programs whose evaluations used 
control group designs with good evidence of initial experimental-control 
equivalence, study durations of at least a semester, and standardized 
measures of reading and math, two categories of programs emerged as 
particularly effective: Continuous progress and cooperative learning. 
These approaches have in common the use of instruction to small, skill-based 
groups; instruction from the teacher rather than from materials or peers; 
and frequent assessment of student progress through a hierarchical 
curriculum. 

However, while this review is based on the best evidence currently 
available concerning the achievement effects of practical classroom 
instructional models, it is important to reiterate that this evidence is far 
from definitive. Even the carefully selected programs emphasized in this 
review could prove to be ineffective in later research; as noted previously, 
it is aiwiiys possible that data submitted to the JDRP or to journals is from 
the best sites, not from a representative sample of implementations. It is 
also likely that many programs excluded from this review will prove to be 
effective or could be modified to be effective. Further, while it is of 
sc«ne use to know that Program X is effective in comparison to traditional 
methods, there is much more we must understand to make this information 
useful in informing us about program design and classroom instruction in 
general. What elements of Program X account for its effects? For whom is 
it effective? Does it hove important side effects? 
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We are still at ? primitive state in understanding effective instruction 
for students at risk. What is needed is a renewed focus cn development, 
evaluation, and component analyses of programs based on many models of 
teaching and learning. 

One ^'pproach to identifying effective, transportable instructional models 
for students at risk would be learning from the mistakes of the Planned 
Variation Follow Through evaluation but trying again to accomplish its 
objectives. A very small proportion of the enormo^^s Chapter I budget, for 
example, could fund development of promising models, evaluation and 
component analyses of such models, and ultimately evaluation of the models 
in new sites with multiple measures (not only standardized tests), with 
random assignment of programs to schools. In this way, we could finally 
achieve the ambitious goals of Chapter I to make a meaningful difference in 
the life chances of disadvantaged students. 

While we cannot conclude that the search for effective classroom programs 
for at-risk students is over, the findings of this review do have important 
implications for current practice* It identified several programs which 
have demonstrated potential for increasing the achievement of disadvantaged 
and low achieving students and identified program elements which seem to 
characterize effective models. The identified programs are all designed to 
be replicable and transportable; in most cases the developers have training 
staffs who work to help school districts successfully implement the models. 
However, there are two notes of caution that practitioners should bear in 
mind. First, all of the effective models identified in this review are 
complex, and are quite different from usual practice. Adequate provisions 
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for training, followup, and monitoring of project implfmentations is 
imperative. Any of these programs can fail (and most have failed) when 
inadequate resources have been devoted to implementation. Program 
implementations should start on a small scale, beginning with volunteers and 
only gradually expanding from a solid base of success. Second, districts 
should conduct their own evaluations of new programs, using random 
assignment or careful matching to compare program users and non-users* The 
fact that a pr^grou was effective elsewhere suggests but does not guarantee 
that i** will be successful in any particular district. 

The search for reliably effective programs for students who are at risk 
for school failure is a task of great 5-mportance. Tliis paper and companion 
papers by Madden and Slavin (1987) and Karweit (1987) are intended to sum up 
where we ar3 now, in the hope that we can learn from the past in helping 
direct future research, development, and evaluation into effective 
instruction for our neediest students. 
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Continue 



Program and Source Grades Description 

Subjects 

I. Programs Evaluated Using Control Group Designs 

DISTAR K-6 Teacher instructs small 

(Becker & >^arnine, Reading groups using highly 
1980; Abt; Math structured, scrip ed 

JDRP #77-122. lessons. Students fre- 

JDRP #80-50) quently asse *d. re- 

grouped. 



U-SAIL 

(JDRP #76-95) 



1-9 Combines continuous pro- 
Reading gress. individualized act- 
Math ivities. Adapts to exist- 
ing curriculum, materials. 



PEG ASUS- PACE 
(JDRP #1. 
JDRP #79-1) 



K-8 Students proceed through 
Reading 17 rdg levels, gr. K-8; 

frequently assessed & re- 
grouped. 



ECRI 1-6 
' (JDRP #74-48) Reading 



Teacher instructs small 
groups; frequent mastery 
checks 



Project INSTRUCT K-3 
(JDRP #75-37) Reading 



Cross-grade grouping 
according to rdg skills; 
sti'dents proceed through 
skills at own rate. 



GEMS 

(JDRP #79-2) 



K-12 Students work in small 
Reading groups or individually on 
materials at their own 
level. Frequent assess- 
ment, mastery tests, cor- 
rective instruction. 



Early Childhood 
' Preventative 

Curriculum 
^ ^ (JDRP #74-57) 



1 only Students identified as 
Reading high risk given intensive 
continuous progress, diag- 
nostic-prescriptive program 
with small group and in- 
dividualized activities. 



Table 1 
s Progress Programs 



Evaluation 



Effect Sizes 



Abt evaluation found positive 
effects in rdg and math in mo *• 
sites; JDRP-approved sites 
include Flint. MI; Dayton. 
OH; Flippin. AK; E. Las 
Vegas. NM; Uvalde. TX; 
Kingstrte. SC; Cherokee. 
NC; Washington. DC. 



Gr. K-3 
(4 yrs.) 



Rdg. Comp. 
Language 
Math Comp. 
Math P. S. 



+ .07 
+ .84 
+ .57 
+ .17 



Both time series and control 
group comparisons show clear 
effects in reading and m^th 
in suburban school near Salt 
Lake City. Not disadvantaged . 

Matched control group design 
found convincing effects in 
reading in rural Princeton. 
IL; seme students were Title I» 
but mostly not disadvantaged . 

Most evaluations fall-spring 
or spring-spring. One control 
group evaluation in suburban 
Chapter I school near Cincinnati 
found convincing effects. 



Gr. 


3- 


5 




Reading 


+ 




(2 


yrs 


.) 




Math 


+ 


27 


Gr. 


2 


+ 


72 


Rdg. Voc. 


+ 


53 


Gr. 


3 


+ 


80 


Rdg. Comp. 


+ 


.61 


Gr. 


4 


+ 


.50 








Gr. 


5 


+ 


.39 








Gr. 


6 


+ 


.43 








Gr. 


4 






Rdg. Voc. 


+ 


.51 










Rdg. Comp. 


+ 


.39 



Compared students in Lincoln. 
NE. schools that had success- 
fully implemented program to 
matched schools. 

Not disadvantaged . 

Compared project schools with 
matched controls in suburb of 
Salt Lake City 

Not disadvantaged . 



Gr. 2 



Gr. 
Gr. 
Gr. 
Gr. 
Gr. 
Gr. 



+ .11 
+ .00 
+ .41 
+ .23 
+ .44 
+ .39 



Word. Knwl. 
P ading 



Rdg. 
Rdg, 



Voc. 
Comp. 



+ .30 
+ .23 



+ .T9 
+ .14 



Compared project schools to 
matched controls in Miami. FT 



Gr. 1 



Par. Mng. 

WRAT 



+ .95 
+ .28 



Table 1, continued 



Program and g>ource Grades 



Description 



S ubjects 

!!• Programs Evaluated Using Year-to-Year Gains 



WILASD 

(JDRP #83-2) 



COMP 

(JDRP #74-114) 



aiMB 

(JDRP #81-44) 



Outcomes-Driven 
Developmental 
Model (Mamary 
& Rowe, 1985; 
JDRP #85-7) 



1-3 Combines continuous 
Reading progress, whole class, 
and individualized 
instruction. 



1-8 Students taught ir small 
Math groups, progress through 
25 levels in 8 yrs. 



1-12 Diagnostic-prescriptive, 
Reading continuous progress. 
Math Title I tchrs mostly 

consult, do some pullout* 

1-8 Students grpd across grades 
Reading according to skills. Fre- 
Math quent mastery assessments, 
corrective instruction. 
Uses cooperative learning 
during independent practice. 



Evaluation 



Compared cohorts in mostly 
Hispanic, low income school 
in Weslaco, TX before pro- 
gram begun and and then 
several years after. Clear 
differences in scores. 

Fall-to-fall trends in NCE's 
show clear gains in math, 
Columbia, MO - 
Not disadvantaged . 

Grade 4-6 Title I students in 
suburban Middlesex, gained 
significantly in fall-to-fall 
rdg and math NCE*s. 

Gains over several years in 
percent of students scoring 
above grade level, Johnson City, 
NY. Mostly not disadvantaged . 
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Table 2 

Individualized Instruction Programs 



. . ^?,.o1..a^^r.n Effect Sizes 
Program and Source Description fcvaiuation 

Subj ects 

I. Programs Evaluated Using Control Group Designs 

Matteson Four-Dinr 3-8 Students work on indiv- Compared project classes to Gr. 3-6 Reading +.57 

ensional Reading Reading idualized learning pack- matched control classes in U yrs.; 

Program ets 40-60% of rdg time. suburban Chicago. 

(JDRP #77-109) !!£l disadvantaged . 



II, Programs Evaluated Using Year- to- Year Gains 



AIRS 

(JDRP #74-25) 



STAMM 

(JDRP #76-87) 



1-6 Students mostly ''ork on 
Reading individualized materials, 
some small group work. 



K-8 Structured, individualized 
Math program with programmed 
materials. 



Comparison of spring testing 
shows cohort gains in sub- 
urban Andover, MA. 
Not disadvantaged . 

Evaluations in suburban 
Denver and suburban Atlanta 
show gains in spring NCE*s. 
Not disadvantaged . 



1 
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Table 3 

Cooperative Learning Programs 



Program and Source 



Grades 
Subjects 



Description 



I. Programs Evaluated Using Control Group Designs 



Team Accelerated 3-6 
Instruction Math 

(Slavin et al., 

1984; Slavin & 

Katveit, 1985: 

JDRP #8A-5) 

Cooperative Integrated 3-5 
Reading & Composition Reading 

(Stevens et al.. Writing 

in press) 



Students work on program- 
med materials in mixed- 
ability teams whi' e tchrs 
teach same-ability teach- 
ing groups. 



Students work i^ mixed- 
ability teems vhile tchr 
teaches rrading groups. 



Evaluation 



In three studies, two with 
random assignment, one matched, 
TAI classes exceeded control. 
One study in rural MD» one in 
suburban MD, one in urban DE. 



CIRC classes exceeded matched 
control classes in study 
in suburbs of Baltimore. 



c 
r 
I 

( 
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1 



Effect Sizes ^ 



Full Sample Math Comp. +.50 

(Or. 3-6) Math C&A +.06 

Mainstreamed Math Comp. +.38 

Students Math C&A +.47 

Full Sample Rdg. Voc. +.12 

(Or. 3-4) Rdg. Comp. +.35 

Oral Rdg. +.54 

Language +.30 

Remedial Rdg. Voc. +.26 

Students Rdg. Comp. +.40 

Oral Rdg. +.71 

Language +.37 

Mainstreamed Rdg. Voc. +.90 

Students Rdg. Comp. +.99 

Language .00 
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